Model Selection

Local deployment

# Local deployment

Jan-nano-8bit is an 8-bit quantized version converted from the Menlo/Jan-nano model, optimized for the MLX framework and suitable for text generation tasks.

Large Language Model

Minicpm4 8B Q8 0 GGUF

MiniCPM4-8B-Q8_0-GGUF is a model converted from openbmb/MiniCPM4-8B to GGUF format via llama.cpp, suitable for local inference.

Large Language Model

Transformers Supports Multiple Languages

Chinda Qwen3 4b Gguf

Chinda LLM 4B is a cutting-edge Thai model launched by iApp Technology, built on the Qwen3-4B architecture, bringing advanced thinking capabilities to the Thai AI ecosystem.

Large Language Model

Qwen3 235B A22B 4bit DWQ

Qwen3-235B-A22B-4bit-DWQ is a 4-bit quantized version converted from the Qwen3-235B-A22B-8bit model, suitable for text generation tasks.

Large Language Model

Qwen3 8B 4bit AWQ

Qwen3-8B-4bit-AWQ is a 4-bit AWQ quantized version converted from Qwen/Qwen3-8B, suitable for text generation tasks in the MLX framework.

Large Language Model

Qwen3 30B A3B GGUF

The GGUF quantized version of Qwen3-30B-A3B, supporting multi-bit quantization, suitable for text generation tasks.

Large Language Model

Qwen3 30B A3B 4bit

Qwen3-30B-A3B-4bit is a 4-bit quantized version converted from Qwen/Qwen3-30B-A3B, suitable for efficient text generation tasks under the MLX framework.

Large Language Model

Qwen3 0.6B GGUF

GGUF quantized version of Qwen3-0.6B, suitable for text generation tasks.

Large Language Model

Qwen3 14B MLX 4bit

Qwen3-14B-4bit is a 4-bit quantized version of the Qwen/Qwen3-14B model converted using mlx-lm, suitable for text generation tasks.

Large Language Model

lmstudio-community

OuteTTS is a text-to-speech (TTS) model focused on the Turkish language, based on a 500M parameter scale, capable of converting Turkish text into natural speech.

Speech Synthesis Other

3b Ko Ft Research Release Q4 K M GGUF

This is a 3B-parameter language model optimized for Korean, converted to GGUF format for compatibility with llama.cpp.

Large Language Model Korean

Mistral Small 3.1 24b Instruct 2503 Hf GGUF

This is a GGUF format quantized version of the mrfakename/mistral-small-3.1-24b-instruct-2503-hf model, suitable for text generation tasks.

Large Language Model

Gemma 3 4b Pt Q4 0 GGUF

This is a GGUF format model converted from Google's Gemma 3.4B parameter model, suitable for text generation tasks.

Large Language Model

Llama 3.1 8B RainbowLight EtherealMix GGUF

This is a quantized version in GGUF format based on the Llama-3.1-8B-RainbowLight-EtherealMix model, which facilitates the development of applications related to text generation.

Large Language Model

GGUF format quantized version of QwQ-32B, suitable for local text generation tasks.

Large Language Model

MMS TTS THAI FEMALEV1

This is a Thai female voice text-to-speech (TTS) model, fine-tuned based on the VITS architecture, supporting high-quality Thai speech synthesis.

Speech Synthesis Other

Indri 0.1 124m Tts GGUF

Indri is a text-to-speech (TTS) model supporting English and Hindi, with a parameter size of 124M, optimized for CPU inference in GGUF format.

Speech Synthesis Supports Multiple Languages

Gte Qwen2 7B Instruct GGUF

A large language model developed by Alibaba NLP team, based on the Qwen2 architecture with 7B parameters, supporting instruction interaction

Large Language Model

Mlx Stable Diffusion 3 Medium

MLX implementation of Stable Diffusion 3 Medium, focused on text-to-image generation

Image Generation English

Smollm 135M 4bit

This is a 4-bit quantized 135M parameter small language model, suitable for text generation tasks in resource-constrained environments.

Large Language Model

Transformers English

Deepseek V2 Lite Chat GGUF

DeepSeek-V2-Lite-Chat is a lightweight chat model optimized based on the DeepSeek-V2 architecture, suitable for efficient dialogue generation tasks.

Large Language Model

Gemma 2 27b It Q8 0 GGUF

This is a GGUF format model converted from Google's Gemma 2B model, suitable for text generation tasks.

Large Language Model

Qwen2 7B Instruct GGUF

The GGUF quantized version of Qwen2-7B-Instruct, suitable for local deployment and inference

Large Language Model

Llama 2 7b Ukrainian Q8 0 GGUF

This is a Ukrainian and English language model based on the Llama-2-7b architecture, converted to GGUF format for use with the llama.cpp framework.

Large Language Model Supports Multiple Languages

Meta Llama 3 8B Instruct Q4 K M GGUF

The GGUF quantized version of the Llama 3 8B instruction model, suitable for local inference and supporting efficient deployment

Large Language Model English

Gemma 7B Instruct Function Calling

Gemma is a series of lightweight cutting-edge open-source large language models launched by Google, developed based on the Gemini technology framework, supporting English text generation tasks.

Large Language Model

Tinyllama 1.1B Chat V1.0 GGUF

TinyLlama is a lightweight 1.1B-parameter Llama model optimized for chat and programming assistance tasks.

Large Language Model English

Pandora-v1-13B is a 13B-parameter large language model that integrates multiple 7B models, using the passthrough fusion method to combine the best-performing 7B models from the OpenLLM leaderboard.

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase